Block compression of tables with repeated values

ABSTRACT

Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

BACKGROUND

The present disclosure relates to data processing by digital computer,and more particularly to block compression of tables with repeatedvalues.

Search engines may search large amounts of data in database tables, suchas relational tables, to find results. For massive amounts of data, suchas a combination of tables containing millions of records, processing ofthe data may require lots of hardware resources. For example, largeamounts of random access memory space may be needed to store all therecords that are relevant to executing a user request.

SUMMARY

The subject matter disclosed herein provides methods and apparatus,including computer program products, that implement techniques relatedto block compression of tables with repeated values.

In one aspect, a column of data is compressed in accordance withdictionary-based compression to generate a column of value identifiers,the value identifiers are sorted, a list of block identifiers isgenerated, columns of block dictionaries are generated, and a blockoffset column is generated. For each block of the value identifiers,there is a unique block identifier for each unique value identifier andthere are like block identifiers for like value identifiers. For eachblock dictionary, there is a list of block identifiers, where each blockidentifier is associated with a value identifier, and there is a blockidentifier existing in a block dictionary for each unique value of theblock identifiers. Each value of the block offset column indicates anoffset at which a block starts in the columns of block dictionaries.

In a related aspect, value identifiers representing a compressed columnof data are sorted, and a block dictionary is generated. A blockdictionary may be generated for each block of the value identifiers.Each of the block dictionaries may include a list of block identifiers,where each block identifier is associated with a value identifier andthere is a block identifier for each unique value in a block.

The subject matter may be implemented as, for example, computer programproducts (e.g., as source code or compiled code tangibly embodied incomputer-readable media), computer-implemented methods, and systems.

Variations may include one or more of the following features.

The value identifiers may be values representing structured businessdata having data dependencies across a same row of a table. The businessdata may include business objects modeled as sets of joined tables.

Block dictionaries, block vectors, and the like may be generated inparallel on multiple hardware servers.

Changes to a column of data may be stored in a delta buffer separatefrom the column of the data and the changes may be integratedasynchronously.

Block dictionaries, block identifiers (e.g., in a block vector or columnof block vectors), and block offset values may be stored, and searchesmay be enabled on the block dictionaries.

A size of each block of the value identifiers may be a fixed number ofrows.

A column of data may be sorted with other columns in a table ofstructured data. The sorting may include sorting the column of data togenerate groupings of value identifiers, and selectively sorting blocksof succeeding columns based on previous columns, where a block of asucceeding column is sorted if a block of a previous column has asingle, same value identifier.

A block identifier may be assigned for each of the value identifiers. Anordering of block identifiers may match an ordering of the valueidentifiers. Block identifiers may include a numbering sequence thatcommences for each block. Each block dictionary may be compressedaccording to a binary coding such that a minimal length of bits is usedto represent block identifiers for each block dictionary. For a blockdictionary, block identifiers may only exist once for each unique valueof the block identifiers.

The subject matter described herein can be implemented to realize one ormore of the following advantages. Efficient processing of mass amountsof database data, such as relational tables containing mass amounts ofdata, may require a high level of data compression to retain datavolumes in installed memory (e.g., volatile memory) or on disk storagedevices, and for efficient data flows when moving the data (e.g., from ahard disk drive to memory). Compression may have multiple effects in aninformation processing hardware landscape because reduced data volumesmay require less installed main memory or hard disk capacity, andreduced data flows may place lower demands on processor cache, processorarchitectures, and on network bandwidth. All this may have a beneficialeffect on hardware requirements, response times, and overall systemperformance. Data, such as business data, may be compressed and searchedin memory, as significant compression of the data may allow forcost-effective in-memory processing of the data (e.g., a number ofservers or amount of physical memory may be reduced). Compression may beachieved by generating vectors of block identifiers and compressed blockdictionaries for one or more blocks of a column of data. Advantageously,multiple, frequently occurring values of a column of data may berepresented in a compressed fashion by a combination of blockdictionaries and vectors representing occurrence of values in blocks. Tominimize space occupied by a block dictionary, a minimal amount of bitsmay be used to code block identifiers in a block dictionary. Blockdictionaries may be reused across multiple blocks of data to decreasememory consumption of block dictionaries. Multiple columns of data maybe compressed using block dictionaries and vectors of block identifiers.For structured data, data dependency relationships may be maintained bysorting columns of data based on other columns of data for which thereare data dependencies.

Details of one or more implementations are set forth in the accompanyingdrawings and in the description below. Further features, aspects, andadvantages will become apparent from the description, the drawings, andthe claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A is a block diagram illustrating a table of structured data, adictionary for a column of the table, an attribute table, and a mainindex.

FIG. 1B is a block diagram illustrating a table of structured data, adictionary for a column of the table, an attribute table, and a deltaindex.

FIG. 1C is an example illustrating generation of result sets from mainand delta indexes.

FIGS. 2A-2B are block diagrams illustrating tables of structured data.

FIG. 3 is a table illustrating a cardinality of attributes and keyfigures.

FIGS. 4A-4B are block diagrams illustrating columns being compressedaccording to vector-based compression.

FIG. 5 is a block diagram illustrating a system to compress data andsearch compressed data.

FIG. 6 is a block diagram of a table illustrating a sorting of dataacross multiple columns.

FIGS. 7A-7B are flowcharts illustrating processes of compressing dataand enabling searches of compressed data.

FIG. 8 is a block diagram illustrating compression of a column of datausing compressed dictionaries.

FIG. 9 is a block diagram of a table illustrating a sorting of dataacross multiple columns.

FIG. 10 is a flowchart of a process of compressing data.

Like reference numbers and designations in the various drawings indicatelike elements.

DETAILED DESCRIPTION

In general, in FIGS. 1-10, data may be compressed using a combination oftechniques, which may be referred to as dictionary-based compression,bit vector compression (or, vector-based compression), sorting bitvector compression (or, shortened vector-based compression), and blockvector compression. The data may be structured business data, where thedata is structured in the sense that data may be attributes or keyfigures which are organized in a data structure, such as a table, andattributes or key figures may have dependencies. For example, in a tableof information, a row may have dependencies among data in the row suchthat data in each of the columns of the row is associated with otherdata in other columns of the row. The data may form a sparsedistribution in the sense that particular values such as null values ofdata may be instantiated exceedingly often across thousands or millionsof rows either in part of the data structure, such as in certain rows ina table, or across the entire data structure. For example, a column ofdata having twenty million entries may include nineteen million nullentries, where the nineteen million null entries are located in variousrows that are not necessarily adjacent.

FIG. 1A is a block diagram illustrating a table 105 of structured data,a dictionary 110 for a column of the table, an attribute table 115, andan index 120. In general, FIG. 1A illustrates how a column 125 in thetable 105 may be provided with the dictionary 110 that specifies valueidentifiers 130 (ValueIds) for the values in the column 125, theattribute table 115 listing value identifiers 135 for respectivedocument identifiers 140 (DocIds), and the index 120 listing documentidentifiers 145 for respective value identifiers 150.

The dictionary 110 may be used to provide what may be referred to asdictionary-based compression, which may involve using the dictionary 10to reduce an amount of data stored in a table by representing values inthe table with identifiers that may take up less memory. In general, thedictionary 110 is a list, which may be sorted, of values appearing in acolumn and identifiers of the values (i.e., the value identifiers).

As an example, to reduce the memory or disk space occupied by a columnfrom a data table by means of dictionary-based compression, a sortedlist of different values appearing in a column may be generated and thedifferent values may be numbered. The numbers (implemented, for example,as integers rather than strings that may represent the valuesthemselves) may be used as placeholders for the values in the tableswhere the values appeared. The largest numbers needed to represent thevalues may be noted. If the cardinality C of a column is defined to bethe number of different values appearing in it and if the total lengthof the column is N, then in a case where the value of C is much lessthan N, dictionary-based compression may bring benefits, such as reducedmemory consumption in contrast to storing of the values in the table. Asorted list of C values may be referred to as a dictionary and may beused to look up the values of the numbers appearing in the tablewhenever these values need to be determined, for example, to returnreadable results to a user.

For example, the table 105 includes a column 125, which has values suchas INTEL, ABB, and HP. The dictionary 110 includes value identifiers120, which may represent the different values that may be in the column125. For example, the attribute table 115 includes a value identifierfor each row of the column 125. For example, the first row 160, referredto as document identifier 1 (“DocId 1”), includes a value INTEL that isrepresented in the attribute table 115 with the value identifier 4 basedon the dictionary 110 having the value identifier 4 associated with thevalue INTEL. Values of the column 125 of the table 105 may be replacedwith the value identifiers of the attribute table 115, which may reducememory footprint of the data represented by the column 125. The valueidentifiers in the new table, along with the dictionary 110 may be usedto reconstruct to the values of the column 125.

To facilitate value lookup and thus render contents of the column in aform that is more adapted for query execution, the index 120 may begenerated and that index may replace the column 125. The index 120 is atable of lists of rows of the column 125 organized by their valueidentifier. For example, one list 155 indicates that a fourth valueidentifier is associated with rows 1, 4, and 8 of the column 125.

As an example of a memory impact of a table, a number of rows in a tableT, such as the table 105 of FIG. 1A, may be equal to N being 1,000,000;and a number of bytes required to code each row may be 500, where 500bytes is equal to 4,000 bits, which is equal to 4 kilobits. Withoutcompression, the table T may require 1,000,000 times 500 bytes of space,which is equivalent to 500 megabytes of memory space; and, bandwidthrequired to move the table T in one second may be 4 gigabits per second.

As an example of how the data may be organized, a number of differentvalues in column A of table T may be equal to a cardinality C of A being250. In that example, the dictionary for column A may be a list of Cvalues numbered by integers from 0 to 250. The binary representation ofintegers up to 250 may require eight bits (since two to the eighth poweris equal to 256) which is one byte. For an example table T consisting often columns similar to column A, an average uncompressed column entry inany column occupies fifty bytes (500 bytes divided by ten columns isfifty bytes per column). The dictionary for column A may require about100 kilobits (from the fact that 250 entries×(1 one byte valueidentifier plus 50 bytes to represent the corresponding value in adictionary entry) is about 12 kilobytes, which is about 100 kilobits).Thus, with dictionary-based compression, column A may occupy 1,000,000bytes, which is about one megabyte with a total space required by columnA with dictionary-based compression being about 1.01 megabyte (space ofcompressed column plus corresponding dictionary); whereas a nominalspace occupied by column A without compression may be 50 megabytes. Sothe compression factor may be about 50.

FIG. 1B is a block diagram illustrating a table 170 of structured data,a dictionary 172 for a column of the table, an attribute table 174, anda delta index 176. In general, features of the block diagram of FIG. 1Bmay operate similarly to features of FIG. 1A. For example, both of thetables 105, 170 store records that may be compressed withdictionary-based compression using the dictionaries 110, 172 of FIGS. 1Aand 1B, respectively, as described above.

In contrast to the block diagram of FIG. 1A, the block diagram of FIG.1B includes the delta index 176, which may be used to store changes,which may include additions, modifications, or deletions, to data of acolumn. In particular, the delta index 176 includes a result of changesto data in a compressed column. The dictionary values of the delta index176 may be chronologically ordered, as may be a case of a typical deltaindex. The chronological order may reflect an ordering of changes madeto data over time, as indicated by associations of the values in thedelta index 176 with indicated document identifiers. For example, thelist 178 of document identifiers 1, 4, and 8, which are associated withthe value identifier 3 may indicate that document identifier 1 was addedbefore the document identifier 4, which was added before the documentidentifier 8. The chronological ordering of dictionary values of thedelta index 176 may advantageously enable incremental writes withoutrevision of previous entries.

The table 170 of FIG. 1B may reflect changes to, or deltas of, the table105 of FIG. 1A. For example, each of the rows of the table 170corresponding to the delta index 176 may represent a row to add to thetable 105 corresponding to the main index 120. As another example, wherea row in the table 170 has a same record identifier as a row of thetable 105, the row in the table 170 corresponding to the delta index 176may represent a replacement of the row in the table 105. As anotherexample, a row in the table 170 may represent a difference between a rowhaving a same record identifier in the table 105 (e.g., a positive ornegative incremental difference in values in various columns of a samerecord).

Implementations of the delta index 176 may differ. For example, althoughFIG. 1B shows the delta index 176 as including dictionary-basedcompression of delta values, that need not be the case.

FIG. 1C is an example illustrating generation of result sets from mainand delta indexes. For example, the delta index 176 may be used inconjunction with an index of compressed data of a column (e.g., such asthe main index 120 of FIG. 1A), where the delta index 176 may storechanges to data in the index. Both the delta index 176 and an index ofcompressed data of a column may be searched and results from the twosources may be merged to produce a composite result which reflects thechanges made to the compressed data (e.g., as discussed below withreference to a delta buffer).

In the example, total revenue for a company (labeled “IBM”) iscalculated by finding the total revenue from the main index andincrementing it with the total revenue from the delta index. Inparticular, the command 180 “RETURN TOTAL REVENUE FOR IBM SALES” may bebroken into two operations, by a server program, where the twooperations include a first operation 182 for revenue for IBM from a mainindex (rows 5 and 6 of FIG. 1A correspond to a value identifier of 3,for “IBM,” in the main index, having a total value of 11 euros) and asecond operation 184 for revenue for IBM from a delta index (rows 5 and6 of FIG. 1B correspond to a value identifier of 4, for “IBM,” in thedelta index, having a total value of 10 euros). And, results from thosetwo operations may be merged by an operand 186, where merging mayinclude incrementing the results of the main index with the results ofthe delta index. Increments may be positive or negative. Updates toentries in the main index may be handled by first canceling an old rowthen inserting the updated row, where the cancelation can in oneimplementation be represented by a negative increment in the delta indexand the insert by a positive increment.

FIGS. 2A-2B are block diagrams illustrating tables 202, 204 ofstructured data. In general, a first table 202 represents animplementation of a sales table not being compressed according todictionary-based compression and a second table 204 represents andimplementation of the first table 202 being compressed in accordancewith dictionary-based compression.

The first table 202 includes columns representing different types ofdata and rows representing different records based on a combination ofthe different types of data. For example, the columns represent salesnumbers 206, dates 208, location codes 210, product code 212, a numberof products sold 214, packaging attributes 216, a currency unit 218, atotal value in cents 220, and an invoice number 222. The first row 224includes a combination of information being a record, including, a salesnumber S2551, a date of 20040904, a location code of L164, a productcode of P21191, and the like.

The second table 204 represents values of the columns 206-222 of thefirst table as dictionary-based compressed values. The types ofcompressed values include attributes, but not key figures (for reasonsthat may be irrelevant to compression of the values), and attributes arerepresented by identifiers in dictionaries 232. For example, values ofthe sales numbers 206 of the first table 202 are compressed as six digitinteger values (which for up to about 250 thousand values may berepresented by eighteen bit integer identifiers) in the first column 228of the second table 204, where those integer values may represent avalue in a first dictionary 230 but the number sold key figure values214 of the first table 202 are not represented in a dictionary (and arerepresented as floating-point numbers for reasons that need not berelevant to compression of the values). A first sales identifier 0000 inthe first dictionary 230 represents a value S2500 of the first column228 of the second table 204.

Although FIGS. 2A-2B include a certain type of dictionary-basedcompression, such compression may differ. For example, although in thesecond table 204 of FIG. 2 key figures are not compressed according todictionary-based compression, in some implementations a combination ofkey figures and attributes may be compressed according todictionary-based compression, only key figures may be compressed, or acombination of select key figures and attributes may be compressed.

FIG. 3 is a table 300 illustrating cardinalities of attributes and keyfigures. For example, the table 300 may include a listing ofcardinalities of respective columns of the first and second tables 202,204 of FIGS. 2A, 2B. In addition to illustrating cardinalities, thetable 300 includes for each cardinality the number of bits required tocode the respective columns. For example, the number of bits required tocode column M_3 of cardinality 3 is 2, since 2 to the second power is 4,which is the smallest integral power of 2 that is greater than or equalto 3.

The columns of the table 300 include a first column 305 identifying acolumn of another table, a second column 310 indicating cardinalities ofvalues in associated columns, and, a third column 315 indicating numbersof bits required to code associated columns based on associatedcardinalities. For example, a first entry 325 of the table 300 indicatesthat a column of attributes identified as M_1 has a cardinality oftwenty four, which requires five bits to code in binary (since two tothe fifth power is the smallest integral power of two that is greaterthan or equal to twenty four).

The table 300 may be used to reduce a memory impact of a table bygenerating column widths of values based on cardinalities of theirvalues, which may be used in combination with dictionary-basedcompression.

FIGS. 4A-4B are block diagrams illustrating columns being compressedaccording to vector-based compression. The compression may be referredto as bit vector compression. In general, the compression may involvefinding a most frequent value in a column and representing theoccurrence or non-occurrence of the value using a bit vector for thecolumn. For example, a one may represent an occurrence of the value anda zero may represent a non-occurrence of the value. The compression mayfurther involve generating a number of occurrences of a frequentlyoccurring value and removing occurrences of the frequently occurringvalue from the bit vector to generate a smaller or reduced bit vector.For example, in contrast to the series of block diagrams in FIG. 4A, theseries of block diagrams in FIG. 4B further includes reducing a bitvector of sorted values to a number of occurrences of a frequentlyoccurring value and a shortened bit vector representing other values.The values in the columns of FIGS. 4A-4B may be dictionary-basedcompression values.

In the first series of block diagrams of FIG. 4A, a bit vector 406 isgenerated for a column 402 of values, as indicated by the first arrow404. The bit vector 406 is populated with zeros and ones, as indicatedby the second arrow 408, with zeros representing a non-occurrence of afrequently-occurring value 0000 and one representing an occurrence ofthe value. The frequently occurring value that is used to populate a bitvector may be a most-frequently occurring value or just a value thatoccurs more frequently than other values. Determining whether a valueoccurs frequently may be based on a tally of values from a scan of acolumn of data, or, statistical analysis of a value expected to occurmost frequently (e.g., a null value may be expected to be amost-frequently occurring value in a table of exceptions where onlyexceptional values are non-null). The scope of occurrence fordetermining whether a value occurs frequently may be limited to a columnof data (i.e., frequently occurring values may differ on a column bycolumn basis). The column 402 of values may be compressed, as indicatedby the third arrow 410, by removing occurrences of the most-frequentlyoccurring value. For example, the value 0000 is removed from the column402 to generate a compressed column 412. To reconstitute values from thecolumn 402 based on the compressed column 412, the bit vector 406 may beused to indicate positions of values in the compressed column 412 andpositions of a frequently occurring value.

For example, once dictionary-based compression has been performed,further compression can be achieved by deploying bit vectors forrelevant columns as follows. For a given column A containing afrequently repeated value, such as a null value, a most frequent value Fmay be found in column A and coded using a bit vector V for the column.The bit vector V may have N terms, where N is a positive integerrepresenting a number of rows in column A. If V is written as a columnbeside column A, V may include a 1 beside each appearance of value F inA and a 0 beside any other row in A. The bit vector V may be separatedfrom the column vector A and all rows including value F may be deletedfrom A to generate a compressed column vector A*. The column A may bereconstituted from the compressed column vector A* by reinserting valuesF as specified by the bit vector V, and the uncompressed column ofreadable values can be reconstituted from A by using a dictionary asspecified through a dictionary-based compression technique.

As an example of how reduction in memory may be realized, for an exampletable T with N equal to 1,000,000 rows and having a column A, let themost frequent value F in column A appear 990,000 times in A. The other10,000 values in A may be taken from the remainder of the set ofdifferent values listed in the dictionary for A, which may contain atotal of C being 250 different values. In this example, the bit vector Vfor column A may contain 1,000,000 bits, which is about 1 megabit, whichis about 125 kilobytes. A compressed column A* may include 10,000entries, where each is coded with 8-bit (i.e., 1 byte) integers, to givea space occupancy of 10 kilobytes (i.e., 10,000 entries×1 byte). A totalspace required by column A without compression may be 50 MB (asdiscussed with reference to a non-compressed example column A). A totalspace required with vector compression may include space for adictionary, space for the compressed column A*, and space for the bitvector V. Total space required by a vector-based compression version ofcolumn A may be 147 kilobytes (12 kilobytes for the dictionary, 10kilobytes for a compressed column, and 125 kilobytes for a bit vector).A compression factor of about 340 may be realized (i.e., 50,000kilobytes uncompressed/147 kilobytes compressed in accordance with avector-based compression implementation).

In contrast to the first series of block diagrams of FIG. 4A, the secondseries of block diagrams of FIG. 4B involves sorting a column of data toassist in generating an amount representing a number of occurrences of afrequently occurring value. In the second series of block diagrams ofFIG. 4B, a sorted version of a column 414 is generated as shown by afirst arrow 416 to the sorted column 418. Then, a frequently occurringvalue is represented by a bit vector 422 and the sorted column 418 hasthe frequently occurring value replaced, as shown by a second arrow 420to the bit vector 422 and the reduced column 424, which has a frequentlyoccurring value 0000 removed. A number 428 representing an amount ofoccurrences of a frequently occurring value is generated as shown by athird arrow 426. In addition, the bit vector 422 is reduced to remove agrouping of the frequently occurring value to generate the reduced, orshortened, bit vector 430. As some data might not be sorted to the frontor top of bit vector 422, the shortened bit vector 430 may be used todetermine whether a frequently occurring value occurs later in thereduced column 424. For example, dependencies among data across variouscolumns, sorting rules, or a combination of factors might prevent avalue of a column from being sorted to a grouping of a frequentlyoccurring value. To reconstitute a full column, the number 428 ofoccurrences of the value in a grouping and the shortened bit vector 430may be used, in combination with the reduced column 424.

For example, once a table has been compressed by means ofdictionary-based compression and vector-based compression, a furtherlevel of compression may be possible in cases where many columns havemany instances of frequently occurring values (e.g., many null or zerovalues). Rows in the table may be sorted to bring as many of themost-frequently occurring values in columns to a top of their columns aspossible (e.g., as described with reference to table 600 of FIG. 6), andbit vectors may be generated for the columns. The topmost blocks offrequently occurring values may be replaced in the bit vectors bynumbers recording how often frequently occurring values appear in theblocks (e.g., as shown by the number 428 of FIG. 4B). The use of thenumber may allow for bit vectors to be shortened and an increase of anoverall compression ratio.

As a more detailed example, an example table T may have M columns with amost-frequently occurring value in column 1 being F_1, a most-frequentlyoccurring value in column 2 being F_2, and the like to F_M, where anumber of occurrences of a value F_J in a column J may be written as|F_J| and the column J may be any of the columns (i.e., any of thecolumns from 1 to M). The columns may be numbered such that a columnordering is given by the frequency of the most frequent values F, sothat the column with most F values is first and the column with fewest Fvalues is last. Thus, columns 1 to M may be numbered such that|F_1|>|F_2|> . . . >|F_M| (e.g., as may be the numbering of columns 602in table 600 of FIG. 6).

The rows of table T may be sorted by column 1 such that all values F_1are on top. The sort order may be indifferent to an internal ordering ofthe top |F_1| rows, which may be sorted by column 2 such that all valuesF_2 are on top. The sort order may now be indifferent to an internalordering of the top block of rows with value F_2, such that these rowsare sorted by a column 3 such that all values F_3 are on top. Thesorting of rows may continue until a topmost block of rows with valueF_(M−1) are sorted to put values F_M on top (e.g., continued for all butthe last column M). All the F_1 rows may be on top, with a large numberof the F_2 rows on top, somewhat fewer F_3 rows on top, and so on, withdecreasing completeness (e.g., as depicted in table 600 of FIG. 6). Thismanner of sorting might not be a theoretically optimal sorting formaximizing a final compression ratio, but may be relatively easy toimplement, such that it runs faster than a more complex approach (e.g.,more efficiently utilizes processing resources), and may often be closeto an optimal sorting.

Follow the detailed example, bit vectors V_1 to V_M may be written forthe columns 1 to M, where each bit vector V_J for a column J includes‘1’ values for an occurrence of values F_J and ‘0’ values for any othervalue. The result may be a set of bit vectors V_J that each start with asolid block of values F_J. For each V_J, the solid block of values F_Jand write in their place a number n_J recording how many bits weredeleted in V_J. For sparse tables T (i.e., tables having a frequentoccurrence of a most-frequent value where instances of the value are notnecessarily adjacent), space occupied by a shortened bit vectors V*_Jplus the numbers n_J may be significantly less than space occupied by afull bit vectors V_J.

Shortened vector-based compression may greatly improve efficiency ofaggregation of columns of values. For example, in cases where all thefrequent values F_J are zero, aggregating the values of initial segmentswith a length n_J of the columns may be trivial (e.g., as n_J×zero iszero), and code exploiting this triviality may be much cleaner andfaster than would have been the case without the shortened vector-basedcompression.

As an example of how much compression may be realized, let a table Tagain be as described above, with N being 1,000,000 rows; the tablehaving 10 columns A_1 through A_10; a most frequent value F_1 in columnA_1 appearing 990,000 times; and another 10,000 values in A_1 containinga total of 250 different values, each at 1 byte. Without shortenedvector-based compression, but with vector-based compression column A*_1for a most-frequently occurring value F_1 may occupy 10 kilobytes and abit vector V_1 may occupy 125 kilobytes.

With shortened vector-based compression, a shortened bit vector V*_1 maycontain 10,000 bits and occupy 1.25 kilobytes. The number n_1representing the block of 1 bits for V_1 for the sorted column A_1 maybe in decimal notation 990,000 and require 20 bits in binary notation(i.e., less than 3 bytes). Total space required by A_1 compressed withshortened vector-based compression may include space for a dictionary,space for the short column A*_1, space for the short bit vector V*_1,and space for the number n_1. So the space for A_1 with shortenedvector-based compression may be less than 27 kilobytes (12 kilobytes, 10kilobytes, 1.25 kilobytes, and 3 bytes).

As described above, total space required by a column A_1 withoutcompression may be 50 megabytes. With shortened vector-basedcompression, a compression factor may be greater than 1800 (50,000kilobytes/27 kilobytes, suitably rounded). Compression factors for othercolumns having shortened vector-based compression, such as columns A_2to A_10 may be less, depending on how many of the values F_J for J beingfrom 2 to 10 are sorted to the tops of their columns, but for sparsetables the overall compression factor can still be high enough to makesuch compression well worthwhile, even allowing for the overhead coderequired to reconstruct the columns and read out selected values ondemand.

For both vector-based compression and shortened vector-basedcompression, some overhead resource consumption (including but notrestricted to processing and memory consumption) may be required tocompress and decompress columns, and to facilitate efficient reads ofvalues in a column without decompressing an entire column. Theadditional overhead may take both space (including but not restricted tomain memory space) and time (e.g., measured in terms of percentageutilization of processor core resources) to run, and the penalty mightbe considered in setting thresholds for an employment of shortenedvector-based compression (e.g., in contrast to vector-based compression,dictionary-based compression, or neither). The thresholds may be setheuristically and by testing on various tables. In implementationsinvolving tables containing mass amounts of data, a selection ofcompression techniques may be used (e.g., different techniques fordifferent columns) to minimize memory consumption in lieu of processingconsumption. By minimizing memory consumption, fewer hardware resources,such as a number of blade servers and an amount of installed physicalmemory, may be required. In addition, overall speed of compressing dataand responding to queries may be improved as a minimal memory footprintmay allow for compressing and searching of data in main memory (e.g.,volatile memory, such as random-access memory, having a response timequicker than a secondary memory, such as a hard disk drive used forpersistent storage), and processing overhead for implementingcompression may be acceptably small.

FIG. 5 is a block diagram illustrating a system 500 to compress data andsearch compressed data. The system 500 includes a search enginemanagement tool 502, hosts 504, and storage 506. In general the system500 may be used to search compressed data of the hosts 504 using thesearch engine management tool 502, and the data may be organized andcompressed by the hosts 504. In addition, the data held in memory at thehosts 504 may be persisted at the storage 506, in either compressed oruncompressed form. The search engine management tool 502 may be anintegrated service with the services that perform searching andcompression, implemented redundantly on each of the hosts 504 (forexample in such a manner as to provide mutual monitoring and backup).

The hosts 504 may be organized such that each of the hosts 504 holds aportion of rows of data. For example, a first host 508 may hold a firstmillion rows and a second host 510 may hold a second million rows.Distribution of rows of data across the hosts 504 may be uniform, whichmay improve parallel processing. For example, this may be done toimprove parallel processing to decompress, search and recompress rows ofdata across the hosts 504. As a result of the distribution of data, eachcolumn of a series of columns from one to M, with M being a positivenumber, may be split into parts one through N, with N being a positiveinteger, and one part being allocated to each host where each host isresponsible for the portion allocated to that host (e.g., responsiblefor indexing and compression or for decompression, search andrecompression of an allocated portion).

A logical index for a table of data may be stored at one of the hosts504 and that logical index may be used to determine where data residesacross the hosts 504 and to coordinate processing. For example, thefirst host 508 includes a logical index 518 that may indicate where rowsof data are located across the hosts 504. As an example of coordinatingprocessing, the search engine management tool 502 may query the firsthost 508 for results responsive to a search and the first host 508 mayuse the logical index 518 to coordinate searching of the hosts 504 andmerging results to provide to the search engine management tool 502.

The hosts 504 may be blade servers that share the storage 506. Thestorage 506 may include an index, such as a first index 512,corresponding to each of one or more tables from a database for whichthe data is compressed at the hosts 504. For example, the tables may befact tables and dimension tables for a multidimensional OLAP (OnLineAnalytical Processing) cube. The indexes may contain a logical indexwith metadata for an index structure and a set of compressed columns.For example, the first index 512 includes a logical index 514, which mayinclude metadata for the data of the hosts 504, and a set of compressedcolumns 516.

Each of the hosts 504 may be responsible for compressing rows of datafor which they are responsible. For example, the first host 508 may beresponsible for compressing rows of data in the compressed columns 520.The compression that is performed may be any of the types of compressiondescribed in this document. A compression scheme, such as shortenedvector-based compression, may be stored with each index at a host andcoordinated by a logical index in the case of a split index.

Each of the hosts 504 also includes a delta buffer. For example, thefirst host 508 includes a first delta buffer 522. The delta buffers maystore any changes to a respective index part of a host. For example, thefirst delta buffer 522 may store changes to data stored in the columns522 of data for a first part of a table. The delta buffers may be usedto store changes to data in the hosts 504 instead of requiring updatesto the data in response to each change (e.g., updates may beasynchronous to avoid hampering performance during queries on the data).That the compressed columns need not be updated for individual changesmay allow for compression to improve overall system performance to agreater degree than if changes were written synchronously to the storedcolumns. For example, processing overhead associated with integratingupdates may be reduced if updates are accumulated in the delta buffersand the accumulated updates used to update the main indexes on a lessfrequent basis, since then the overhead resources for compressing theupdated main index are also consumed on a less frequent basis. Forexample, instead of making a thousand small updates to a column indexand having to decompress and recompress the index each time, thethousand updates can be written to the delta buffer and then all writtentogether to the main index, so that only one cycle of decompression andrecompression is required, thus achieving a thousand-fold reduction inoverhead resource consumption. To find search results, the delta buffermay be searched along with the compressed data in a main index andresults from the delta buffer may be merged with results from the mainindex to generate results that incorporate changes. In someimplementations, depending on implementation and configuration details,each of the hosts 504 may also include one or more delta buffers, forexample, a delta buffer for each index.

FIG. 6 is a block diagram of a table 600 illustrating a sorting of dataacross multiple columns 602. Each of the rows 604 includes values thatare dependent across the columns 602. For example, a first row 608includes a combination of values for structure data that represents abusiness object where each of the values in the first row 608 aredependent on other values in the first row 608 such that a sorting ofthe first column 610 sorts data in the other columns such that thecombination of the values of the first row 608 is maintained. Thebusiness objects may be modeled as sets of joined tables. The modelingas a set of joined tables may be a result of business objects beingdefined in such a way that they can be modeled as sets of joined tables,so a search engine may manipulate the objects by searching over rows ofthe tables and calculating specified joins of the business objects.

The data in the table 600 may be dictionary-based compression values.The data in the table 600 may be a result of a sorting the data to groupmost-frequently occurring values in preparation of a vector-basedcompression technique. For example, the value 0 may be a most-frequentlyoccurring value for each of the columns 602 and the rows may have beensorted such that groupings of the value are generated at a top mostportion of the columns 602.

The sorting of the rows 604 in the table 600 may have taken into accounta most-frequently occurring value across the columns 602. For example, asummation row 606 indicates a number of occurrences of a most-frequentlyoccurring value for each of the columns 602. The columns 602 may havebeen ordered such that a most-frequently occurring value of the firstcolumn 610 occurs more frequently than other frequently occurring valuesof each of the columns 602, and, most-frequently occurring values ofother columns are also ordered such that values occurring morefrequently across the column are ordered from A_2 to A_9. Based on asorting of columns, the rows may be sorted to generate as manyfrequently occurring values in groups at one end of a row, and thatsorting may take into account dependencies across columns.

For example, the columns may be ordered horizontally as A_1 to A_9 byhow many 0 values they contain, as shown by the summation row 606. Thefirst column 610, labeled column A_1, may have all of its rows sorted tobring all the 0 values to the top. Then, a second column 612, labeledcolumn A_2, may have rows 1 to 19 sorted to bring the 0 values of thoserows to the top. The sorting of the second column 612 being restrictedto rows 1 to 19 may be based on maintaining the sorting order of the topmost values of the first column 610 and dependencies of data across arow of data. For example, as rows 1 to 19 of the first column 610includes the most-frequently occurring value of that column, to maintainthe grouping of the most-frequently occurring value in rows 1 to 19,only those rows may be sorted in the second column 612. This techniqueof sorting may be followed for each of the remaining columns. Forexample, the third column 614, labeled column A_3, may have rows 1 to 15sorted to bring 0 values of those rows to the top, where 0 values ofother rows may maintain their position (e.g., row 17 includes a 0value). As other examples, a fourth column 616, labeled column A_4, mayhave rows 1 to 14 sorted to bring 0 values to the top of those rows; afifth column 618, labeled column A_5, may have rows 1 to 10 sorted tobring 0 values of those rows to the top; a sixth column 620, labeledcolumn A_6, may have rows 1 to 8 sorted to bring 0 values of those rowsto the top; a seventh column 622, labeled column A_7, may have rows 1 to7 sorted to bring 0 values of those rows to the top; an eight column624, labeled column A_8, may have rows 1 to 6 sorted to bring 0 valuesof those rows to the top; and, a ninth column 626, labeled column A_9,may have rows 1 to 4 sorted to bring 0 values of those rows to the top.

Each of the sorted columns may be compressed using vector-basedcompression, such as the shortened vector based compression describedwith reference to FIG. 4B. The sorting of columns may optimize memorysavings of compression by pushing larger blocks of frequently occurringvalues to one end of a column, such that, for example, shorter bitvectors may be generated with the sorting than may be generated withoutsuch sorting.

Although the table 600 includes a certain organization of data that maybe a result of a sorting, sorting may differ and the data may differ.For example, although a same value, 0, is a most-frequently occurringvalue for each of the columns 602, tables may differ and differentvalues may be a most-frequently occurring value for each of the columns602. As another example, a most-frequently occurring value for onecolumn may be used for sorting all of the columns, or all of the columnsneed not be sorted.

FIGS. 7A-7B are flowcharts illustrating processes 700, 702 ofcompressing data and enabling searches of compressed data. The processes700, 702 may be implemented by the hosts 504 of FIG. 5. For example eachof the hosts 504 may perform the process 700 on a portion of data forwhich a host is responsible. The data that is compressed may bestructured business data. The data may be compressed and searched inmemory, as significant compression of the data may allow forcost-effective in-memory processing of the data (e.g., a number ofservers or an amount of physical memory may be reduced).

In general, in the process 700 of FIG. 7A, dictionary-based values ofcolumns of data in memory are sorted (704), vectors representingfrequently occurring values of the columns are generated (706), numbersrepresenting frequently occurring values are generated (708), and thenumbers and shortened vectors are stored (710).

Sorting of dictionary-based values (704) may involve sorting from alowest value to a higher value for one or more columns of values. Ifvalues of multiple columns are sorted, the sorting may involve sortingsome columns before others based on an ordering of columns (e.g., anordering based on a number of most-frequently occurring values incolumns of a table). For example, the sorting of columns 602 based onthe order of the columns 602 described with reference to the table 600of FIG. 6 may be performed. The sorting may take into accountdependencies of data across columns. For example, the dictionary-basedvalues may represent data that is structured with dependencies acrosscolumns for a same row and an association of values in a row may bemaintained. The sorting may take place in a server, such as one of thehosts 504 of FIG. 5.

For example, sorting dictionary-based values may involve, for eachcolumn of a table, ordering columns such that a column including amost-frequently occurring value (MFOV) of any column is ordered firstand sorting of other columns is based on records within the topmostrange of a previous column (714).

Vectors representing frequently occurring values of the columns aregenerated (706). The vectors may be bit vectors that represent theoccurrence or non-occurrence of a frequently occurring value in a rowwith a bit. The vectors may be generated for all of the columns or foronly some of the columns. The vectors may be generated for parts of acolumn by each server responsible for a portion of data (e.g.,respective hosts of the hosts 504 of FIG. 5). A frequently occurringvalue might be, but need not be, a most-frequently occurring value, suchas a most-frequently occurring value within a scope of a column. Forexample, a bit vector representation for a most-frequently occurringvalue of each column may be generated (716).

Numbers representing frequently occurring values are generated (708).Each of the numbers may represent a number of occurrences of afrequently occurring value in a column. For example, FIG. 4B includes anumber six (428) indicating six occurrences of a frequently occurringvalue 0000. The number of occurrences may be limited to a number ofoccurrences in a group of the frequently occurring value at one end of acolumn (e.g., at a top or bottom). For example, one column may have agroup of a frequently occurring value at a top, the number may representthe number of occurrences of the value in that group, and the column mayinclude other instances of the value. For example, a number representinginstances of a most-frequently occurring value at a topmost section ofeach column may be generated (718).

Numbers and shortened vectors are stored (710). For example, the numberrepresenting occurrences of a frequently occurring value may be storedwith a shortened vector. For example, a set of a number and a bit vectormay be stored for each column of a table. The shortened vector may be avector representing occurrences of the frequently occurring value withthe occurrences of the value represented by the number removed from thevector to generate the shortened vector. In addition to storing theshortened vector, instances of the frequently occurring value in thegroup of the value may be removed from a column to generate a shortenedor reduced column. For example, top ends of bit vectors includingmost-frequently occurring values may be removed (720), and numbersrepresenting the number of occurrences of most-frequently occurringvalues and shortened bit vectors for each column may be stored (722).Also, the top end of a column including a group of most-frequentlyoccurring values may be removed or all instances (e.g., topmost andother instances, if any) of a most-frequently occurring value may beremoved to generate a shortened column (and, e.g., may be reconstitutedusing the bit vector).

In general, in addition to implementations of sub processes of process700 of FIG. 7A, the process 702 of FIG. 7B further includes generatingdictionary-based compression values (712; e.g., as described withreference to FIG. 1A), shortening vectors representing values of columns(720; e.g., as described in the above paragraph), storing a compressedcolumn in memory (722), and decompressing and recompressing a column asrequired to execute queries (724).

Performing a search on a compressed column may involve loading data of acompressed column into memory (e.g., non-volatile memory from compresseddata in a persistent storage), decompressing the data to a temporarystructure, and selecting rows as specified by the search.

Although the processes 700, 702 of FIGS. 7A and 7B include certainsub-processes in a certain order, additional, fewer, or differentsub-processes may exist and those sub-processes may be in a differentorder. For example, a column may be sorted based on bit vectorrepresentations of most-frequently occurring values of a column, ratherthan sorting an entire column (e.g., only values most-frequentlyoccurring may be sorted to one end of a column and other values need nothave a sorting among them).

As another example, dictionary-based compression, general vector-basedcompression, and shortened vector-based compression may be applied basedon determinations as to whether a type of compression, if any, isexpected to optimize performance (e.g., reduce memory consumption). Forexample, dictionary-based compression might be performed ifdictionary-based values and a dictionary are expected to consume lessmemory than non-dictionary-based values of a column. As another example,only rows having attributes but not key figures may be compressed usingany type of compression.

FIG. 8 is a block diagram illustrating compression of a column of datausing compressed dictionaries. The diagram includes columns 802, 806 ofcompressed data, a vector 810 of block identifiers, columns 814 of blockdictionaries, and a column 820 of offset values corresponding to theblock dictionaries.

The compression illustrated in FIG. 8 may be referred to as “blockvector compression,” as blocks of data may be compressed to one or morevectors of block identifiers. The compression may be referred to asbeing blockwise as compression of data is applied to blocks of data,such as blocks of rows. The compression may use dictionary-basedcompression and might be an alternative to the techniques ofvector-based compression and shortened vector-based compressiondescribed above. The compression described with reference to FIG. 8 maybe applied as an alternative when one or more columns contain severalvalues that are each repeated many times in a respective column; incontrast to, for example, having only one value repeated many times. Thecompression may be particularly applicable for compressing millions orbillions of rows, where memory efficiencies achieved may significantlyoutweigh detriments caused by processing overhead for the compression.

In general, the compression of FIG. 8 involves dividing a sorted columnof data into blocks and creating a vector of block identifiers referringto a block dictionary for each block. A block dictionary may containjust the values that appear in the block. Block identifiers may be codedusing a minimal number of bits (e.g., binary coded with a bit lengthbeing a minimal amount required to binary code all unique blockidentifiers of a block). A block dictionary may map each blockidentifier to a value identifier representing a unique value in a columnof data. Value identifiers may be looked up in a column dictionarycreated by dictionary-based compression (as described above withreference to dictionary-based compression).

Referring to FIG. 8, a first column 802 of compressed data includesvalue identifiers representing data that has been compressed inaccordance with dictionary-based compression. For example, a valueidentifier “0007” may represent a character string “INTEL.” The firstcolumn 802 of compressed data may be sorted to generate a second column806 of compressed data, as indicated by the arrow 804. The second column806 of compressed data is sorted such that same value identifiers aregrouped together. Although the sorting of the second column 806 ofcompressed data includes a sorting from lower-numbered value identifiersto higher-numbered value identifiers, other sorting techniques may beused which result in multiple value identifiers being grouped together.

A vector 810 of block identifiers is generated based on the secondcolumn 806 of compressed data, as indicated by the arrows 808. Thevector 810 includes a block identifier for each value identifier of thesecond column 806 of compressed data. The vector 810 may be generated byassigning a unique block identifier for each unique value identifier,and assigning a same block identifier for same value identifiers, wherethe block identifiers and value identifiers are at least unique withinthe scope of a block of data (which, in this example, is a block ofvalue identifiers). Each block identifier may be an integer having aminimal binary coding based on an amount of block identifiers for ablock. The vector 810 may be a collection of vectors for each block,where a series of block identifiers for a block forms a vector for thatblock.

For example, in FIG. 8 there is a block size of three rows (in practice,a block size might be hundreds or thousands of rows for a certaincategory of data tables; a size of a block may be tested for efficacy orchosen arbitrarily). For a first block 828 of value identifiers, allvalue identifiers are the same (i.e., “0000”), such that a same blockidentifier (‘0,’ in this case) is included in the vector 810. For asecond block 830, there are two unique value identifiers (“0001” and“0002”) such that two unique block identifiers are used for that block(‘0’ and ‘1’) to represent those values in the vector 810, where thesequence of value identifiers 0001, 0001, 0002 is represented as thesequence of block identifiers 0, 0, 1. For a third block 832, thesequence of value identifiers 0002, 0003, 004 are represented as thesequence of block identifiers 00, 01, and 10 in the vector 810, as eachof the value identifiers is unique in the scope of the block. Incontrast to the second block 830, for the third block 832 two bits areused to code the block identifiers in the vector 810, as the blockidentifiers are represented using a minimal amount of bits for eachblock of block identifiers.

Although the vector 810 includes entries for a block vector having asame repeated value, a block vector of the vector 810 may containnothing at all, by convention, to give a total size of 0 kilobits. Werethe vector to have a block size of 1,024 rows, for a 1024-row blockcontaining two different values, a block identifier vector may contain aseries of 1,024 bits, each 0 or 1, to give a total size of 1 kilobits.For a 1,024-row block containing 1,024 different values, the blockidentifier vector may contains a series of 10-bit integers, to give atotal size of 10 kilobits.

The columns 814 of block dictionaries include a column 816 of blockidentifiers and a column 818 of value identifiers. In the columns 814,each block is represented by a block dictionary, where a blockdictionary consists of a combination of block identifiers and associatedvalue identifiers.

The column 816 of block identifiers is generated based on the vector 810of block identifiers as indicated by the arrows 812. In contrast to thevector 810 of block identifiers, the column 816 of block identifiersincludes a minimal amount of block identifiers for a block. The amountis minimal in the sense that for a scope of a block, each utilized blockidentifier appears only once in the listing in column 816 for thatblock. Moreover, the block identifiers for a block that are listed incolumn 816 are listed in a canonical ordering, such as a numericalordering. For example, the first block 828 of value identifiers that wasrepresented in the vector 810 as the sequence 0, 0, 0 is represented asa single block identifier ‘0’ in the column 816 of block identifiers. Asanother example, the third block 832 of value identifiers that isrepresented as the sequence 00, 01, and 10 in the vector 810 isrepresented as the block identifiers 00, 01, and 01 as each blockidentifier is unique in that block.

The column 820 of offset values corresponds to the block dictionaries ofthe columns 814 of the block dictionaries. The column 820 of offsetvalues includes an offset value for each block, and each offset valueindicates where, in the columns 814 of block dictionaries a dictionaryfor a block of data starts. For example, a first offset value 834indicates that block identifiers for the first block of compressed datastarts at a zero offset. As another example, a third offset value 836indicates that block identifiers for the third block of compressed datastarts at an offset of three (i.e., three rows from the top of thecolumn 816 of block identifiers). The column 822 of block identifiers isthe same as column 810 and is repeated to illustrate the correspondencebetween the offset values of the column 820 of offset values and blockidentifiers of the column 822 for each block, and how block identifiersof the block vectors in the column 822 of block vectors may beassociated with the offset values of the column 820 of offset values.The offset values may greatly facilitate fast value lookup and may bemandatory if block dictionaries are reused. For example, for a blocksize of one K or 1,024, scrolling down column 820 of offset values mayenable blocks to be located a thousand times faster. In addition, offsetvalues may be a practical way to enable reuse of a block dictionary.

To save space, such as volatile memory space of a server, only thecolumns 814 of dictionaries of block identifiers, the column of offsetvalues, and the vector 810 of block identifiers need be saved in lieu ofa column of value identifiers, such as the column 802 of valueidentifiers.

The compression discussed with reference to FIG. 8 or similar techniquesmay be implemented at one or more servers, such as the hosts 504 of FIG.5. The technique may run in parallel on different columns of data (e.g.,with a column at each of the hosts 504).

The compression is advantageous as, for example, if a block contains asingle value repeated 1,024 times where a block size is 1,024, and thesingle value is listed in a block dictionary for that block, then zerobits may be required to code the fact that the value is repeated (e.g.,as only a block identifier ‘0’ and an associated value identifier mayrepresent the 1,024 instances of the value in the block and no blockvector may exist, by convention). As another example, if a blockcontains two values (e.g., two values unique within the scope of theblock) and these two values are listed in a block dictionary for thatblock, then one bit is required to specify for each row in the blockvector in which the value occurs (e.g., 0 or 1 for a list of entries ina block vector). As another example, if a block contains three or fourdifferent values, again listed in a block dictionary, then two bits arerequired to specify for each row in the block vector in which the valueoccurs (e.g., 00, 01, 10, or 01). Thus, in general, if a block containsN different values such that the smallest integral power of two which isgreater than or equal to N is 2 to the power of P, then P bits arerequired to specify for each row in the block vector which value in theblock dictionary it takes. In a worst case scenario, if a block contains1,024 different values, then 10 bits are required to specify for eachrow which value it takes (since 1,024 is equal to two to the tenthpower).

As another example of a block dictionary for block vector compression,assuming a column (e.g., the column 806 of value identifiers) has acardinality of C, where C is the number of different values in theentire column, and many of these values may repeat many times such thatblock vector compression may be applicable. If C is between 524,288(i.e., 2^19) and 1,048,576 (i.e., 2^20) the value identifiers for acolumn may require 20 bits.

For a 1,024-row block containing a single repeated value, the blockdictionary may contain a single entry specified by 20 bits. For a1,024-row block containing two different values, the block dictionarymay contain two block identifier bits and two 20-bit value identifierbits, to give a total of 42 bits (i.e., 20 bits for each valueidentifier and 1 bit for each block identifier). For a 1,024-row blockcontaining 1,024 different values, the block dictionary may contain1,024 block identifiers, each with 10 bits, and 1,024 value identifiers,each with 20 bits, to give a total of 30 kilobits (i.e., 30 bits times1,024 entries).

In each case, a few more bits may be required to code an offset, where anumber of bits depends on a length of an entire block dictionary. Addingblock vector bit totals with the block dictionary bit totals for theabove cases gives, respectively, 20 bits, 43 bits, and 40 kilobits forthe three example cases considered. A worst-case scenario total usingdictionary-based compression and bit vector compression together may betwice a total space required using only dictionary based compression fora column, which indicates that the bit vector compression may only beadvantageously applicable for columns where a great majority of blocksinclude many repeated values.

To find a user-readable value associated with a given block identifier,there may be two dictionary lookups, although both lookups may rarely berequired to be performed. For example, in response to a query for allrecords having a given value, first a dictionary for a column may beused to look up a value identifier associated with a value (such as akey figure or attribute), and then a block dictionary may be used tolook up a block identifier associated with the value identifier in agiven block. As another example, from a block identifier to a value, ablock dictionary 814 may be used to look up a value identifiercorresponding to a block identifier and then the dictionary for column802 may be used to look up the value corresponding to the valueidentifier. To look up values in multiple blocks, multiple blockdictionaries may be used to find associated block identifiers for arespective block.

Although FIG. 8 includes a certain combination of features as part ofthe compression, there may be variations. For example, block identifiersmay be unique to a block of data. As another example, block identifiersneed not be compressed to a minimal amount of bits to represent a blockof block identifiers. As another example, block identifiers need notstart at ‘0’ or be in an incremental ordering. As another example,although the word “each” is used in the description of compression ofFIG. 8, the compression need not be so absolutely applied. For example,each block need not be compressed and an offset value need not exist foreach block.

As another example, block dictionaries may be reused across severalblocks. For example, value identifiers of a block may be either the sameas for another block, such as a previous block, or very similar. In suchcases, a previous dictionary may be reused as many times as possible.

For example, if values in a new block (i.e., a succeeding block) areeither the same as in a previous block or form a subset of them, aprevious block dictionary may be used as-is, and the only codingrequired for the new block may be to set its offset to be the same asfor the previous block (as the same block dictionary is reused).

The following three conditions may be required before reuse of adictionary for a succeeding block having additional block identifiers incomparison to a previous block: a succeeding block includes one or morevalues additional to those appearing in a previous block, which may bereferred to as N additional values; a previous block includes Mdifferent values that can be coded using P bits; and a total M+N isstill less than or equal to 2^P, such that the additional blockidentifier—value identifier pairs may be appended to a previousdictionary.

The new values may be appended to a previous dictionary and the offsetfor a new block may be set to be the same as for the previous block. Inthis manner, the new entries to the previous dictionary need not disturbapplicability of the dictionary for the previous block (as the blockidentifier—value identifier pairs for the previous block do not change).If many blocks in a column are similar to previous blocks, reuse of ablock dictionary may decrease space required to store a column by alarge factor.

FIG. 9 is a block diagram of a table 900 illustrating a sorting of dataacross multiple columns 902. Sorted data in the columns 902 may be usedfor block vector compression, which is discussed with reference to FIGS.8 and 10. Columns of the table 900 may be spread across multipleservers, such as the hosts 504 of FIG. 5. In general, the table 900includes columns 902 of rows 904 of data, where one or more rows of datamay represent structured data having data dependencies across thecolumns 902. For example, business objects may be modeled as sets ofjoined tables. As the data in the table 900 has data dependencies acrossthe columns 902, sorting of a column may take into account thesedependencies and may affect a resulted sorting.

The sorted data in the table 900 may represent a precondition for blockvector compression as the data may be sorted to group as many repeatedvalues together as possible. A difficulty may be that sorting the table900 to optimize an ordering of the columns 902 sorts the entire rows, soother columns may become more disordered as a result. The sorting of thedata in the table 900 may represent a sorting where the rows 904 aresorted sufficiently well to generate a good ordering of all the relevanttable columns 902. Given various practical constraints, it may beunnecessary to optimize the technique (e.g., optimize in a mathematicalsense to maximize a number of ordered blocks), and a robust heuristicthat runs fast may be good enough for envisaged applications.

The sorting of the data in the table 900 may be one of many techniquesfor sorting data prior to block vector compression. Shaded sections ofthe table may indicate where there is no sorting in addition to sortingresulting from sorting of other sections (e.g., other columns of a sameset of rows). The data in the table 900 may be value identifiers thatare a result of dictionary-based compression.

The columns 908 include labels for rows 904 of the data, including blocklabels and row labels. For example, the first row 906 is in a block B1and is row 1.

The data in the table 900 is the result of sorting columns 902 from afirst column, to a second column, and so on down the sequence of columns902. The columns 902 are not ordered but may be in some implementations.For example, columns that are expected to or known to have fewer uniquevalues may be ordered earlier in a sequence of columns that are sorted,as a column having fewer unique values may be expected to have largergroupings of values such that block vector compression may be moreefficient than it would be for columns later in the sequence. Also,block vector compression need not be performed on all columns and mayonly be applied on select columns, such as columns that meet somespecified threshold criterion for benefiting sufficiently from blockvector compression. For example, a comparison of space requirements fora column with or without block vector compression may be made and animpact of coding and runtime overheads of block vector compression maybe made. Metrics related to space efficiency and overhead may becompared to determine whether to utilize block vector compression on oneor more columns in a particular implementation.

The data in the table 900 is a result of sorting of columns as follows.In the example, the table 900 has five columns A1 to A5 and 35 rowsdivided into seven blocks B1 to B7, each having five rows, where eachcolumn entry has four possible values, coded with value identifiers zeroto three. The columns are sorted from the first column A1 to the fifthcolumn A5, with a result of sorting of a previous column affectingsorting of a succeeding column. In particular, column A1 is sorted suchthat value identifiers are ascending. Any blocks with uniform valueidentifiers in the column A1 are available for further sorting for thesecond column A2. In the table 900, these are blocks B1, B2, B4, and B7.For example, block B1 has all entries of ‘0’ in column A1. Column A2blocks B1, B2, B4, B7 are each sorted internally such that valueidentifiers are ascending. Any blocks with uniform A2 value identifiersare available for further sorting; these are blocks B1 and B6. Column A3blocks B1 and B6 are each sorted internally such that value identifiersare ascending. Any blocks with uniform A3 value identifiers areavailable for further sorting; this includes block B1. Column A4 blockB1 is sorted internally such that value identifiers ascending. Block B1does not contain uniform A4 value identifiers so there is no furthersorting (i.e., no sorting of value identifiers in A5).

Thus, in general, sorting includes sorting a first column such that likevalue identifiers are grouped together, such as sorting in ascending ordescending order. For succeeding columns, only blocks having a samevalue identifier in a previous column are sorted, and those blocks aresorted such that there is a grouping of value identifiers. The sortingmay continue until no blocks of a previous column have a same valueidentifier.

In some implementations, sorting may have an undesired effect ofhindering table updates, which may require efficient random access totable rows. Updates may be collected and handled by separate deltaindexes (as described above), which may be periodically merged with themain table indexes (i.e., asynchronously). The merge process may involverebuilding table indexes, so a sort order as described above need nothinder table updates. In particular, it need not be a hindrance in thecase that a compression technique involving re-sorting columns isimplemented together with a buffering scheme for handling updates likethe delta index approach described above.

FIG. 10 is a flowchart of a process 1000 of compressing data. Theprocess 1000 may be implemented by the hosts 504 of FIG. 5. For exampleeach of the hosts 504 may perform the process 1000 on a portion of datafor which a host is responsible. The data that is compressed may bestructured business data. The data may be compressed and searched inmemory, as significant compression of the data may allow forcost-effective in-memory processing of the data (e.g., a number ofservers or an amount of physical memory may be reduced).

In general, the process 1000 may be referred to as block vectorcompression, which may refer to blockwise compression of the data intovectors. The process 1000, involves compressing one or more columns ofdata in accordance with dictionary-based compression to generate columnsof value identifiers (1002), sorting the value identifiers (1004),generating block identifiers (1006), generating block dictionaries(1008), and generating offset values (1010).

One or more columns of data are compressed in accordance withdictionary-based compression to generate columns of value identifiers(1002). The dictionary-based compression may use a minimal amount ofbits to represent value identifiers based on a cardinality of values ina column.

The value identifiers are sorted (1004). Sorting of the valueidentifiers may include sorting multiple columns of value identifiersand may further involve sorting columns of value identifiers based onsorted value identifiers of other columns, as, for example, describedwith reference to FIG. 9.

Block identifiers are generated (1006). The block identifiers may begenerated for each block of value identifiers in a selected column. Forexample, a determination may be made to apply block vector compressionto a column and for each block of the column block identifiers may begenerated where, for each block, a unique block identifier is generatedfor each unique value identifier in a block, and like block identifiersare used for like value identifiers.

Block dictionaries are generated (1008). A block dictionary may begenerated for each block, and for each block dictionary blockidentifiers may be associated with a value identifier where blockidentifiers may only be included for each unique block identifier withinthe scope of a block. For example, with reference to FIG. 8, a blockdictionary for the first block 828 has one, unique block identifier ‘0’(838).

Multiple block dictionaries may be included in columns of blockdictionaries. For example, with reference to FIG. 8, multiple blockdictionaries are included in the columns 814 of block dictionaries.

Block dictionaries may be reused. For example a block dictionary for oneblock may be reused for succeeding blocks that have the same blockidentifiers or have additional block identifiers (as described above).

Offset values are generated (1010). The offset values may indicate wherea block starts in columns of block dictionaries, and the offset valuesmay be associated with block identifiers in a block vector. The offsetvalues may be included in a block offset column that is associated witha vector of block vectors (e.g., as shown by the column of offset values820 having offset values associated with the column 822 of blockvectors).

Although FIG. 10 includes a certain combination and type ofsub-processes, the process 1000, block vector compression, or both mayinvolve fewer, different, or additional sub-processes. As examples, notall columns need be sorted; not all columns need be compressed accordingto block vector compression; dictionary-based compression of the columnsneed not be performed for all or any columns; sorting of columns neednot be dependent on results of sorting other columns; block dictionariesmay be reused; additional sub-processes may include receiving a query ofone or more columns of data and the block vectors may be used in concertwith the block dictionaries to find data matching criteria of the query;a delta buffer may be used and the delta buffer may be searched, inparallel, with the block vector and results from the delta buffer may bemerged with other results; block vectors, block dictionaries, and offsetvalues may be stored; offset values might not be generated; and thelike.

Although each of the figures describes a certain combination offeatures, implementations may vary. For example, additional, different,or fewer components may be included in the system 500 of FIG. 5.

The subject matter described herein can be implemented in digitalelectronic circuitry, or in computer software, firmware, or hardware,including the structural means disclosed in this specification andstructural equivalents thereof, or in combinations of them. The subjectmatter described herein can be implemented as one or more computerprogram products, i.e., one or more computer programs tangibly embodiedin an information carrier, e.g., in a machine-readable storage device orin a propagated signal, for execution by, or to control the operationof, data processing apparatus, e.g., a programmable processor, acomputer, or multiple computers. A computer program (also known as aprogram, software, software application, or code) can be written in anyform of programming language, including compiled or interpretedlanguages, and it can be deployed in any form, including as astand-alone program or as a module, component, subroutine, or other unitsuitable for use in a computing environment. A computer program does notnecessarily correspond to a file. A program can be stored in a portionof a file that holds other programs or data, in a single file dedicatedto the program in question, or in multiple coordinated files (e.g.,files that store one or more modules, sub-programs, or portions ofcode). A computer program can be deployed to be executed on one computeror on multiple computers at one site or distributed across multiplesites and interconnected by a communication network.

The processes and logic flows described in this specification, includingthe method steps of the subject matter described herein, can beperformed by one or more programmable processors executing one or morecomputer programs to perform functions of the subject matter describedherein by operating on input data and generating output. The processesand logic flows can also be performed by, and apparatus of the subjectmatter described herein can be implemented as, special purpose logiccircuitry, e.g., an FPGA (field programmable gate array) or an ASIC(application-specific integrated circuit).

Processors suitable for the execution of a computer program include, byway of example, both general and special purpose microprocessors, andany one or more processors of any kind of digital computer. Generally, aprocessor will receive instructions and data from a read-only memory ora random access memory or both. The essential elements of a computer area processor for executing instructions and one or more memory devicesfor storing instructions and data. Generally, a computer will alsoinclude, or be operatively coupled to receive data from or transfer datato, or both, one or more mass storage devices for storing data, e.g.,magnetic, magneto-optical disks, or optical disks. Media suitable forembodying computer program instructions and data include all forms ofvolatile (e.g., random access memory) or non-volatile memory, includingby way of example semiconductor memory devices, e.g., EPROM, EEPROM, andflash memory devices; magnetic disks, e.g., internal hard disks orremovable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.The processor and the memory can be supplemented by, or incorporated in,special purpose logic circuitry.

To provide for interaction with a user, the subject matter describedherein can be implemented on a computer having a display device, e.g., aCRT (cathode ray tube) or LCD (liquid crystal display) monitor, fordisplaying information to the user and a keyboard and a pointing device,e.g., a mouse or a trackball, by which the user can provide input to thecomputer. Other kinds of devices can be used to provide for interactionwith a user as well; for example, feedback provided to the user can beany form of sensory feedback, e.g., visual feedback, auditory feedback,or tactile feedback; and input from the user can be received in anyform, including acoustic, speech, or tactile input.

The subject matter described herein can be implemented in a computingsystem that includes a back-end component (e.g., a data server), amiddleware component (e.g., an application server), or a front-endcomponent (e.g., a client computer having a graphical user interface ora web browser through which a user can interact with an implementationof the subject matter described herein), or any combination of suchback-end, middleware, and front-end components. The components of thesystem can be interconnected by any form or medium of digital datacommunication, e.g., a communication network. Examples of communicationnetworks include a local area network (“LAN”) and a wide area network(“WAN”), e.g., the Internet.

The computing system can include clients and servers. A client andserver are generally remote from each other in a logical sense andtypically interact through a communication network. The relationship ofclient and server arises by virtue of computer programs running on therespective computers and having a client-server relationship to eachother.

The subject matter described herein has been described in terms ofparticular embodiments, but other embodiments can be implemented and arewithin the scope of the following claims. For example, operations candiffer and still achieve desirable results. In certain implementations,multitasking and parallel processing may be preferable. Otherembodiments are within the scope of the following claims

1. A computer program product, tangibly embodied in a computer-readablestorage medium, the computer program product being operable to causedata processing apparatus to perform operations comprising: compressinga column of data in accordance with dictionary-based compression, thecompressing comprising generating a column of value identifiers, each ofthe value identifiers representing a unique value in the column of data;sorting the value identifiers to form a plurality of blocks of valueidentifiers; for each block of the value identifiers, generating a firstlist of block identifiers, the generating comprising: having a uniqueblock identifier for each unique value identifier in a block, and havinglike block identifiers for like value identifiers; generating columns ofblock dictionaries, the generating the columns of the block dictionariescomprising for each block, generating a block dictionary, the blockdictionary comprising: a second list of block identifiers, each blockidentifier associated with a value identifier, and a block identifierexisting in the block dictionary for each unique value of the blockidentifiers in the second list; and generating a block offset column,each value of the block offset column indicating an offset at which ablock starts in the columns of the block dictionaries; wherein changesto the column of the data are stored in a delta buffer separate from thecolumn of the data and the changes are integrated asynchronously.
 2. Theproduct of claim 1, wherein the value identifiers are valuesrepresenting structured business data having data dependencies across asame row of a table.
 3. The product of claim 2, wherein the businessdata comprises business objects modeled as sets of joined tables.
 4. Theproduct of claim 1, wherein the operations of the product are performedin parallel on a plurality of hardware servers.
 5. The product of claim1, wherein the operations further comprise: storing the column of theblock dictionaries and the block offset column to enable searches on theblock dictionaries.
 6. The product of claim 1, wherein a size of eachblock of the value identifiers is a fixed number of rows.
 7. The productof claim 1, wherein the operations further comprise sorting the columnof data with other columns in a table of structured data, the sortingcomprising: sorting the column of data to generate groupings of valueidentifiers; and selectively sorting blocks of succeeding columns basedon previous columns, a block of a succeeding column being sorted if ablock of a previous column has a single, same value identifier.
 8. Theproduct of claim 1, wherein a block identifier is assigned for each ofthe value identifiers in the first list, an ordering of the blockidentifiers in the first list matches the ordering of the valueidentifiers, the block identifiers in the first list comprise anumbering sequence that commences for each block, each block dictionaryis compressed according to a binary coding such that a minimal length ofbits is used to represent block identifiers for each block dictionary,and for each block dictionary, block identifiers only exist once foreach unique value of block identifiers.
 9. A computer-implemented methodcomprising: compressing a column of data in accordance withdictionary-based compression, the compressing comprising generating acolumn of value identifiers, each of the value identifiers representinga unique value in the column of data; sorting the value identifiers toform a plurality of blocks of value identifiers; for each block of thevalue identifiers, generating a first list of block identifiers, thegenerating comprising: having a unique block identifier for each uniquevalue identifier in a block, and having like block identifiers for likevalue identifiers; generating columns of block dictionaries, thegenerating the columns of the block dictionaries comprising for eachblock, generating a block dictionary, the block dictionary comprising: asecond list of block identifiers, each block identifier associated witha value identifier, and a block identifier existing in the blockdictionary for each unique value of the block identifiers in the secondlist; and generating a block offset column, each value of the blockoffset column indicating an offset at which a block starts in thecolumns of the block dictionaries; sorting the column of data with othercolumns in a table of structured data, the sorting comprising: sortingthe column of data to generate groupings of value identifiers; andselectively sorting blocks of succeeding columns based on previouscolumns, a block of a succeeding column being sorted if a block of aprevious column has a single, same value identifier.
 10. The method ofclaim 9, wherein the value identifiers are values representingstructured business data having data dependencies across a same row of atable.
 11. The method of claim 9, wherein changes to the column of thedata are stored in a delta buffer separate from the column of the dataand the changes are integrated asynchronously.
 12. The method of claim 9further comprising: storing the column of the block dictionaries and theblock offset column to enable searches on the block dictionaries.
 13. Amethod comprising: compressing a column of data in accordance withdictionary-based compression, the compressing comprising generating acolumn of value identifiers, each of the value identifiers representinga unique value in the column of data; sorting the value identifiers toform a plurality of blocks of value identifiers; for each block of thevalue identifiers, generating a first list of block identifiers, thegenerating comprising: having a unique block identifier for each uniquevalue identifier in a block, and having like block identifiers for likevalue identifiers; generating columns of block dictionaries, thegenerating the columns of the block dictionaries comprising for eachblock, generating a block dictionary, the block dictionary comprising: asecond list of block identifiers, each block identifier associated witha value identifier, and a block identifier existing in the blockdictionary for each unique value of the block identifiers in the secondlist; and generating a block offset column, each value of the blockoffset column indicating an offset at which a block starts in thecolumns of the block dictionaries; wherein changes to the column of thedata are stored in a delta buffer separate from the column of the dataand the changes are integrated asynchronously.
 14. The method of claim13, wherein the value identifiers are values representing structuredbusiness data having data dependencies across a same row of a table. 15.The method of claim 14, wherein the business data comprises businessobjects modeled as sets of joined tables.
 16. The method of claim 13,wherein the method is performed in parallel on a plurality of hardwareservers.
 17. The method of claim 13, further comprising: storing thecolumn of the block dictionaries and the block offset column to enablesearches on the block dictionaries.
 18. The method of claim 13, whereina size of each block of the value identifiers is a fixed number of rows.19. The method of claim 13, further comprising sorting the column ofdata with other columns in a table of structured data, the sortingcomprising: sorting the column of data to generate groupings of valueidentifiers; and selectively sorting blocks of succeeding columns basedon previous columns, a block of a succeeding column being sorted if ablock of a previous column has a single, same value identifier.
 20. Themethod of claim 13, wherein a block identifier is assigned for each ofthe value identifiers in the first list, an ordering of the blockidentifiers in the first list matches the ordering of the valueidentifiers, the block identifiers in the first list comprise anumbering sequence that commences for each block, each block dictionaryis compressed according to a binary coding such that a minimal length ofbits is used to represent block identifiers for each block dictionary,and for each block dictionary, block identifiers only exist once foreach unique value of block identifiers.
 21. A computer program product,tangibly embodied in a computer-readable storage medium, the computerprogram product being operable to cause data processing apparatus toperform operations comprising: compressing a column of data inaccordance with dictionary-based compression, the compressing comprisinggenerating a column of value identifiers, each of the value identifiersrepresenting a unique value in the column of data; sorting the valueidentifiers to form a plurality of blocks of value identifiers; for eachblock of the value identifiers, generating a first list of blockidentifiers, the generating comprising: having a unique block identifierfor each unique value identifier in a block, and having like blockidentifiers for like value identifiers; generating columns of blockdictionaries, the generating the columns of the block dictionariescomprising for each block, generating a block dictionary, the blockdictionary comprising: a second list of block identifiers, each blockidentifier associated with a value identifier, and a block identifierexisting in the block dictionary for each unique value of the blockidentifiers in the second list; and generating a block offset column,each value of the block offset column indicating an offset at which ablock starts in the columns of the block dictionaries; sorting thecolumn of data with other columns in a table of structured data, thesorting comprising: sorting the column of data to generate groupings ofvalue identifiers; and selectively sorting blocks of succeeding columnsbased on previous columns, a block of a succeeding column being sortedif a block of a previous column has a single, same value identifier. 22.The product of claim 21, wherein the value identifiers are valuesrepresenting structured business data having data dependencies across asame row of a table.
 23. The product of claim 21, wherein changes to thecolumn of the data are stored in a delta buffer separate from the columnof the data and the changes are integrated asynchronously.
 24. Theproduct of claim 21, wherein the operations further comprise: storingthe column of the block dictionaries and the block offset column toenable searches on the block dictionaries.